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Abstract 

Prior knowledge can improve the performance of spectrum sensing. Instead of using universal features as prior 
knowledge, we propose to blindly learn the localized feature at the secondary user. Motivated by pattern recognition 
in machine learning, we define signal feature as the leading eigenvector of the signal's sample covariance matrix. 
Feature learning algorithm (FLA) for blind feature learning and feature template matching algorithm (FTM) for 
spectrum sensing are proposed. Furthermore, we implement the FLA and FTM in hardware. Simulations and 



hardware experiments show that signal feature can be learned blindly. In addition, by using signal feature as 
prior knowledge, the detection performance can be improved by about 2 dB. Motivated by experimental results, 
we derive several GLRT based spectrum sensing algorithms under rank-1 assumption, considering signal feature, 
signal power and noise power as the available parameters. The performance of our proposed algorithms is tested 
on both synthesized rank-1 signal and captured DTV data, and compared to other state-of-the-art covariance matrix 
based spectrum sensing algorithms. In general, our GLRT based algorithms have better detection performance. In 
addition, algorithms with signal feature as prior knowledge are about 2 dB better than algorithms without prior 
knowledge. 

Index Terms 

Spectrum sensing, cognitive radio (CR), generalized likelihood ratio test (GLRT), hardware. 

The authors are with the Wireless Networking System Lab in Department of Electrical and Computer Engineering, Center for 
Manufacturing Research, Tennessee Technological University, Cookeville, TN, 38505, USA. E-mail: pzhang21@students.tntech.edu, {rqiu, 
nguo}@tntech.edu 



2 

I. Introduction 

Radio frequency (RF) is fully allocated for primary users (PU), but it is not utilized efficiently Q. 0, 
||3l show that the utilization of allocated spectrum only ranges from 15% to 85%. This is even lower in rural 
areas. Cognitive radio (CR) is proposed so that secondary users (SU) can occupy the unused spectrum from 
PU, therefore improving the spectrum efficiency and enabling more RF applications. Spectrum sensing is 
the key function in CR. Each SU should be able to sense PU's existence accurately in low signal-to-noise 
ratio (SNR) to avoid interference. 

Spectrum sensing can be casted as the signal detection problem. The detection performance depends 
on the available prior knowledge. If the signal is fixed and known to the receiver, matched filter gives 
the optimum detection performance [|4)-[]6). If signal is unknown, signal samples can be modeled as 
independent and identically distributed (i.i.d.) random variables, as well as noise. In such model, energy 
detector gives the optimum performance. However, though energy detector is blind to signal, it is not 
blind to noise. show that actual noise power is not obtainable and noise uncertainty problem can 
heavily limit energy detector's performance. In addition, signal is usually oversampled at the receiver, and 
non-white wide-sense stationary (WSS) model is more appropriate for signal samples. 

Prior knowledge of PU signal is often considered in spectrum sensing algorithms. One class of spectrum 
sensing algorithms utilize prior knowledge from universal pre-determined signal spectral information. 
Take spectrum sensing algorithms for DTV signal for example. Pre-determined spectral information 
includes pilot tone (3), spectrum shape [0 and cyclostationarity [fTDl , etc. Generally speaking, they have 
good performance when it is assumed that those pre-determined features are universal. However, such 
assumption is not true in practice. From IEEE 802.22 DTV measurements [fTTI as shown in Fig. \T\ spectral 
features are location dependent due to different channel characteristics and synchronization mis-match, etc. 
Therefore, we cannot rely on universal pre-determined signal features for spectrum sensing. Furthermore, 
these non-blind algorithms are only limited to DTV signals. In this paper, we propose to use localized 
signal feature learned at SU for spectrum sensing, so that feature can be location dependent. Motivated 
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Fig. 1. Spectrum measured at different locations in Washington D.C.. Left: 'Single Family Home'; Right: 'Apartment (High-Rise)'. The 
pilot tones are located at different frequency locations. Two spectrum suffer different frequency selective fading. 

from pattern recognition in machine learning [H2l . we define the signal feature as the leading eigenvector 
of signal's sample covariance matrix. According to DKLT |[T3l . |fl4l . there are two interesting properties: 

1) The leading eigenvector is stable over time for non-white WSS signal while random for white noise. 

2) The leading eigenvector for non-white WSS signal is most robust against white noise. 

Using these properties, we develop the feature learning algorithm (FLA) for blind feature learning and 
the feature template matching (FTM) algorithm using blindly learned signal feature for spectrum sensing. 
Noise uncertainty problem is avoided because actual noise power is not used in spectrum sensing. 

We blindly measure the feature similarity of the 25 seconds DTV data samples captured in Washington 
D.C. every 4.6 ms. Surprisingly, all features are almost exactly the same. In addition, simulation results will 
show that the detection performance of FTM can be improved by 2 dB, compared with other covariance 
matrix based algorithms without any prior knowledge. 

Motivated by the above simulation results, we have a patent disclosure for both algorithms [15] and 
verify them using Lyrtech hardware platform with FPGA and DSP IfTBI , IfTTl . We have spent 2 man-months 
to implement FLA and FTM in Lyrtech hardware platform. We have performed a blind feature learning 
experiment in non-line-of-sight (NLOS) environment showing feature's stability over time. Moreover, we 
compare the detection performance of FTM and CAV in hardware as well. In the experiment, FTM is 
about 3 dB better than CAV without any prior knowledge. 

We further develop various algorithms using the general likelihood ratio test (GLRT) method with signal 
feature as one of the available parameters. Unfortunately, close form results for GLRT are not always 
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TABLE I 

Cases Considering Available Parameters Under Rank- 1 Assumption 





kjlglld.1 r UWC1 


Mai cti Pr\\i7*ar 
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Case 1 


Yes 


Yes 


Yes 


Case 2 


No 


Yes 


Yes 


Case 3 


No 


No 


Yes 


Case 4 


No 


Yes 


No 


Case 5 


No 


No 


No 



obtainable [fT8l . For mathematical convenience, we derive GLRT algorithms under rank-1 assumption. 
There are three parameters for analysis: signal power, noise variance and signal feature. It is very 
convenient to perform analysis under rank-1 GLRT framework. We derive GLRT algorithms for 5 cases 
listed in Table HI We use both rank-1 signal and captured DTV signal for simulation. Though DTV signal is 
not rank-1, simulation results have shown relative very good detection performance for our derived GLRT 
algorithms. Overall, among algorithms without noise uncertainty problem, our GLRT based algorithm in 
Case 3 and FTM with signal feature as prior knowledge is about 2 dB better than algorithms without prior 
knowledge. Interestingly, Case 3 with feature as prior knowledge is only slightly better than FTM, within 
0.1 dB, though FTM has much lower computational complexity. In addition, our GLRT based algorithm 
in Case 5 is slightly better than AGM, which is the counterpart algorithm for Case 5 derived in [19], [1201 
without rank-1 assumption. 

Our algorithm derivations are different with those in [1191 , EDI in that: 

1) For the first time, we use the properties of the eigenvector, not eigenvalue, for spectrum sensing. 

2) For the first time, we analyze the problem with signal feature as one of the parameters using GLRT 
method. 

3) We derive GLRT based algorithms under rank-1 assumption. 

During the preparation of this paper, we notice that [21] has also derived several GLRT based algorithms 
using the latest results from the finite-sample optimality of GLRT 11221 . ||2T1 has shown interesting results 



by introducing prior distribution of unknown parameters to obtain better performance, compared with 
classical GLRT. We will extend our current work based on the methods in ETTl . E2l . 

The organization of this paper is as follows. Section [n] describes the system model. Section Hill reviews 
the blind feature learning results from our previous work. We present our proposed GLRT based spectrum 
sensing algorithms in Section [IV] Simulation results are shown in Section |V] and conclusions are made 
in Section |VU 

II. System Model 

We consider the case when there is one receive antenna to detect one PU signal. Let r (t) be the 
continuous-time received signal at receiver after unknown channel with unknown flat fading, r (t) is 
sampled with period T s , and the received signal sample is r[n]=r (nT s ). In order to detect PU signal's 
existence, we have two hypothesis: 

H : r[n] = w [n] 

' (1) 
Hi : r [n] = s [n] + w [n] 

where w [n] is the zero-mean white Gaussian noise, and s [n] is the received PU signal after unknown chan- 
nel and is zero-mean non-white WSS Gaussian. Two probabilities are of interest to evaluate detection per- 
formance: Detection probability, P d (Hi\r [n] = s [n] + w [n]) and false alarm probability Pf {H\\r [n] = w [n 

Assume spectrum sensing is performed upon the statistics of the i th sensing segment r r i consisting of 
N s sensing vectors: 

r r ,i = {r(j_i)7v s +i, r(j_i)AT a+2 , • ■ ■ r(j_i)Ar s+ 7v s | (2) 

with 

r 4 = [r\i),r[i + l],--- ,r{i + N-l]] T (3) 

where (-) T denotes matrix transpose, ~ jV(0,R r ), and R r can be approximated by sample covariance 
matrix R r : 
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We will use R r instead of R r for convenience. The eigen-decomposition of R r is: 



JV 



(5) 



i=l 



where 



V,l 



V,2 



V,JV 



(6) 



and 



A r — diag {X r ,i, K,2, • • • , \-,at} 



(7) 



(iiag{-} denotes the diagonal matrix, {4> r ,i) are eigenvectors of R r and {\ r ,i} are eigenvalues of R r , 
satisfying A r> i > A r> 2 > ... > A ri Ar. Accordingly, we have R s , $ s and A s for s^; H w = a 2 I for w i5 where 
I is identity matrix. 

One practical issue is that noise w [n] after analog-to-digital converter (ADC) is usually non-white, 
due to RF characteristics. A noise whitening filter is commonly applied before ADC and the details can 
be found in [|23l . In this paper, r [n] can be viewed as received sample after the noise whitening filter. 
Therefore, w [n] is white and s [n] has taken noise whitening filter into account. In the rest of this paper, 
all noise is considered as white. 



In this section, we will briefly review our previous blind feature learning results. 

DKLT gives the optimum solution in searching signal subspace with maximum signal energy, which are 
represented by the eigenvectors [13], |[T4l . The leading eigenvector, a.k.a. feature, has maximum signal 
subspace energy, which is the leading eigenvalue. Moreover, feature is robust against noise and stable if the 
signal samples are non-white WSS. If signal samples are white, feature is random. This can be illustrated 
in the following way. Geometrically, feature is the new axes with largest projected signal energy [fT2l . 
[fT3l . Let x s be a 2 x 1 zero-mean non-white Gaussian random vector and x n be a 2 x 1 zero-mean white 
Gaussian random vector. x s and x n have same energy and let x s+n = x s + x n . We plot 1000 samples of 



III. Blind Feature Learning 
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Fig. 2. Feature of non-white signal is robust. Feature of white noise has randomness. 

x s , x n and x s+ „ on a two dimensional graph in Fig. |2] It can be seen that new X axes, a.k.a. feature, 
of x s and x s+n are exactly the same, while feature of x n is rotated with some random degree. We can 
use this property to differentiate non- white WSS signal s [n] and noise w [n). Let N x 1 vector (pi be the 
extracted feature from the covariance matrix of sensing segment Y r>i . We obtain two consecutive features 
ifi and <p%+i from T r ^ and T r ^ i+ i, respectively. If (pi and <pj are highly similar, then the signal feature is 
learned. We use the intuitive template matching method to define the similarity of ifi and iff 

N 

= l=1 2 mJ f _ fe+1 I Yl Pi M <P3 l k + l }\ ( 8 ) 
' k=l 

The FLA is outlined as follows: 

1) Extract features ipi and <f>i+\ from two consecutive sensing segments r r i and r r i+ i. 

2) Compute similarity p^+i between these two features using ([8]). 

3) If Pi,i+i > T e feature is learned as </> S)1 = (pi + i, where T e is the threshold that can be determined 
empirically. 

We use captured DTV signal in Washington D.C. with 25 seconds duration [11] to illustrate that feature 
is robust and stable over time. We measure feature similarity of consecutive sensing segments over the 
25-second data with 4.6 ms per sensing segment. It is surprising that the feature extracted from the first 




sensing segment and the last sensing segment has similarity as high as 99.98%. Furthermore, signal feature 
is almost unchanged for about 99.46% amount of time in 25 seconds. Signal feature is very robust and 
stable over time. 

With learned signal feature </> s>1 as prior knowledge, we develop the intuitive FTM for spectrum sensing. 
FTM simply compare the similarity between the feature (f) r> \ extracted from the new sensing segment r n f 
and the signal feature (f> s>1 . If </> r l and (p Sjl are highly similar, PU signal exists. The FTM is outlined as 
follows: 

1) Extract feature <p r>i from sensing segment r r ti . 

2) "Hi is true if: 



Tftm = max 

i=l,2,...,iV-fc+l 



N 



fc=i 



>7 (9) 



where 7 is the threshold determined by desired Pf. 

Simulation results of FTM on DTV signal will be shown in Section |VJ Approximately 2 dB gain will 
be obtained over algorithms without any prior knowledge. 

We have implemented the FLA, FTM in Lyrtech software-defined-radio (SDR) hardware platform. 
Another spectrum sensing algorithm based on sensing segment T Tji , CAV [|23l , is also implemented 
in the same hardware as well. CAV uses exactly the same signal as FTM, but CAV does not require 
any prior knowledge. It is considered as a blind benchmark algorithm for comparison purpose. The 
top-level architecture is illustrated in Fig. |3] Covariance matrix calculation is implemented in Xilinx 
Virtex 4 SX35 FPGA; feature extraction (leading eigenvector calculation), similarity calculation and 
CAV are implemented in TI C64x+ DSP. Leading eigenvector calculation is the major challenge in our 
implementation. Since FLA and FTM only use the leading eigenvector for feature learning and spectrum 
sensing, we use FPCA E4l and the computational complexity is reduced from 0(N 3 ) to 0(N 2 ). Without 
much effort in implementation optimization, the leading eigenvector can be extracted within 20 ms. We 
perform the blind feature learning experiment in an NLOS indoor environment. PU signal is emulated 
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Fig. 3. The top-level architecture of the spectrum sensing receiver. DCM: Digital conversion module. DPM: Digital processing module. 

by sinusoidal generated from Rohde & Schwarz signal generator. Transmit antenna and receive antenna 
are 2 meters away, and the direct patch is blocked by the signal generator. A —50 dBm sinusoidal signal 
at 435 MHz is transmitted. SU's RF is tuned to 432 MHz center frequency with 20 MHz bandwidth. 
Channel, signal frequency, signal power and noise power are unknown to the receiver. In the experiment, 
our hardware platform record the feature similarities of consecutive sensing segments around every 20 ms 
for 20 seconds with N s = 2 20 and N = 32. By setting T e = 80%, p i>i+1 > T e for 87.6% amount of time 
when PU signal exists. The similarity of features extracted from the first segment and the last segment is 
94.3%. As a result, signal feature in this experiment is very stable and robust over time. 

Then, we set the feature extracted from the last sensing segment as learned signal feature and 
perform the spectrum sensing experiment. FTM is compared with CAV 11231 , which is totally blind. In 
order to compare the detection performance of both algorithms under the same SNR, we connect the signal 
generator to the receiver with SMA cable. PU feature is already stored as S>1 at the receiver. We vary 
the transmit power of the signal generator from —125 dBm to —116 dBm with 3 dB increments. Cable 
loss is omitted and transmit signal power is considered as received signal power. 1000 measurements are 
made for each setting. The Pd VS Received Signal Power curves at Pf = 10% for FTM and CAV are 
depicted in Fig. |4j It can be seen that to reach Pd m 100%, the required minimum received signal power 
for CAV is at least 3 dB more than FTM. The above hardware experiments show that feature is very 
robust and stable over time, and feature can be learned blindly. We can use feature as prior knowledge 
to obtain better spectrum sensing performance. 
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Fig. 4. P d VS Received Signal Power at P f = 10% for FTM and CAV. 

IV. Detection Algorithms 

We try to derive GLRT based algorithms considering feature as one of the available parameters. In this 
paper's GLRT based algorithms, signal is detectable iff. X Sl i > o 2 . Signal detection under low energy 
coherence (LEC) condition (A Sj i < a 2 ) ll25Tl is not considered. All algorithms requiring a 2 as prior 
knowledge have noise uncertainty problem, because the actual a 2 is not obtainable 0. 



A. GLRT Based Detection Algorithms 

1 ) Background Review: Since s [n] and w [n] are uncorrected, the distribution of received signal vector 
Yi under two hypothesis can be represented as: 



and 



Ho-.r^Af^a 2 !) 



U x :Yi~ JV(0, R s + ct 2 I) 



(10) 



(ID 



The detection will be based upon the statistics of N s sensing vectors in T r)i , say, r r i. If {r^} are i.i.d., 
we have: 



N s 

p(r r ,i Wo) = II p(ri\Ho) 

i=i 

N s 

p{Y r>x \U x ) = np(r^i) 
i=i 



(12) 



Though {r,j} defined in © are not i.i.d., we will use f|T2]) for mathematical convenience. The likelihood 
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function for T r 1 under % condition can be: 



p(T r , 1 \n ) = Ylp(r t \n ) 

i=l 

i=l (27T(T 2 ) 1 



(13) 



and the corresponding logarithm likelihood function is: 

NK 



lnp(r r .i|"Ho) 



The logarithm likelihood function under Hi is: 



In (2tt(j 2 ) 



2a 2 



E 

i=i 



T 



(14) 



p(r,,il^i) = Ilp( r ^i) 



i=i 



= n 



AT l 

i (27r)~det2 (R s + a 2 l) 



cxp 



^rf(R s + <7 2 l) \i 



(15) 



and the corresponding logarithm likelihood function is: 



lnp(r r i|Wi) 



-m27r- - 

2 2 



AT AT S N 

a^EM^ + ^ + EE 



(16) 



: j =1 i=l ^s,i + °" 2 

In signal detection, it is desired to design an algorithm maximizing the P d for a given Pf. According 
to Neyman-Pearson theorem, this can be done by the likelihood ratio test (LRT): 

p(r r ,i|Wi) 



L (r r>1 ) 



p(r r ,i|^o) 



(17) 



or 



lnL(r rj i) = lnp(r ri i|"Hi) - lnp(r ri i|"Ho) 



(18) 



"Hi is true if L (r r l ) or InL (r r l ) is greater than a threshold 7, which is determined by desired Pf. 

In practice, however, it is usually not possible to know the exact likelihood functions. If one or several 
parameters are unknown, composite hypothesis testing is used. GLRT is a common method in composite 
hypothesis testing problems. GLRT first gets a maximum likelihood estimate (MLE) of the unknown 
parameters set © under Ho and Hi'. 



O = argmaxp(r ri i|0 o ,%o) 



©0 



(19) 



©i = argmaxp(r r i|0i,%i) 
©1 
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where © and ©! are the unknown parameters under T-L and Hi, respectively. Hi is true if: 



p(r rl \S 1 ,'H x 

Lg r P)1 ) = ) . f > 7 



p 



(r rj i|©o, H( 



(20) 



or 



lnL G (r rjl ) =lnp(r r) i|8 1 ,?{i) - hip (r r . i|© , H ) >7 



(21) 



Unfortunately, sometimes closed-form solutions for GLRT cannot be derived directly [18]. For math- 
ematical convenience, we will assume the signal covariance matrix to be rank-1 matrix. According to 
DKLT fT3~ll . the optimum rank-1 approximated matrix for R s is 



(22) 



There are three parameters available under rank-1 assumption: A S) i, cr 2 and Sj i. Notice that signal feature 
(f) S) i is also one of the parameters. Therefore, it is very convenient to analyze our feature based spectrum 
sensing algorithm under the rank-1 GLRT framework. We list the algorithms correspondent to different 
combinations of available parameters in Table HI Case 1 is for upper benchmark reference assuming all 
parameters known. Except for Case 1, we do not consider A Sj i as prior knowledge, because it is impractical 
to assume the signal energy of PU as prior knowledge. 
Under rank-1 assumption, only A S) i ^ and (fT6l ) becomes: 



NN. 



1 9 N " 

m2ir 

2 2 



N s 



In A Sjl + a 2 +]T 



T 

s,l r j 



\ N s (A.,i + a 2 



2 



(^-l)ln(a 2 )+EE 



s,i 3 



3=1 i=2 



iV> 2 



(23) 



N 

E 

i=i 



Since $ fl $^ = £ <j> a ,i<f>^i = I, we have: 



N 

£< 

i=2 



K^li 



(24) 



13 



With ®, E (<& 7 ) in d2l becomes: 



i=2 



AT 

E(<fe)' 



' TV 

• i E- 



T 



■■,i<P S A 



j=2 



\«=2 / 

if (I - r 3 



r J r i ~ K!i r i 



Notice that: 



N s 



1 — > r, 

r r = R 

AT,, ^ 3 3 • 



Y r < r 

s j=1 



1 



s j=i 

N 

= E ^r,i 

4 = 1 

^ E (fe) 2 = -jftli E r i r i0.,i 

S j = \ S jr" = l 



tVs,1 



Together with (|27t and (fT4T) . we have: 

lnp (r r> i |"H ) : 
Together with (|2~7T> . (1281) and d23]), we have: 



1 w 

iVln (2^) + — Y, K,i 
V 7 a i=i 



lnp(r r) i|"Hi 



A/W s , A^ 

Mn27r 

2 2 



In (A m + 



2 \ , ''•''II H. r O, | 



A. i + a* 



N. 
2 



(AT - 1) In (a 2 ) + 



' N 

E 

vi=l 



(25) 



(26) 



(27) 



(28) 



(29) 



E K,i — 0f iR-r0s,l 



(30) 



We will use (|29l ) and (1301 ) extensively to derive GLRT based algorithm considering 5 cases in Table U 
2) Case 1: All parameters available: In this case, we have the classical estimator-correlator (EC) test 
HI. U x is true if: 



T E c = Y r J K s{ K s + ° 2 l) Tj 



(3D 
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Details of this derivation can be found in [fT8l , using eigen-decomposition properties. 

Under rank-1 assumption, we can get the new test by replacing R s with R* in (1311) and only A S) i 7^ 0. 
By ignoring corresponding constants, Hi is true if: 



■ CAS El 



A s ,i 7 



(32) 



5) Case 2: a 2 and <j> 8) i available: In this case, we need to get MLE of A S) i. By taking the derivative 
to (1301) with respect to A S) i, we have: 



dlnp(r rjl \\ sA ,Ui) 



Let 



«9A Sjl 2 
d]np(T ril \x s ,i,Hi) = and we have MLE of A M : 



^\R r s ,l 



(33) 



d\ s ,1 



A S ,1 = s .iR r 0s,l - 



(34) 



Together with (12TI) . (|29l ), (1301 and (|34l ) and ignoring the constants, we can get the test for Case 2. Hi 
is true if: 



T C ASE2 = (fi s iRr0s,l > 7 



(35) 



where 7 depends on the noise variance a 2 . 

4) Case 3: </> s 1 available: This is the case when only signal feature is known. We need to get MLE 
of X s 1 and a 2 . By taking the derivative to (1291 ) with respect to cr 2 , we have: 



dlnp(r ril \a 2 ,H ) 
da 2 



N 



<7 



N 

(a 2 ) 2 



Let = and we have the MLE of a 2 under Ho'- 



da 2 



N 



^0 = E K,i/N 



i=i 



By taking the derivative to (1301) with respect to A Sj i, we have: 



dhip (r r!l \\ sA , a 2 ,Hi 
dX s ,i 



1 



2 VA s ,i + ^ 2 (A M + <x 2 )' 



(36) 



(37) 



(38) 
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Let 51 — £ = and we have 

OA..1 



A s ,i + o"i 



^R 



(39) 



Then, by taking the derivative to (1301 with respect to a 2 , we have: 

din^rviA^a 2 ,^) 



2 



9a 2 
1 



^\R r Si l 



Ki + ° 2 (A S> 1 + a 2 ) 



2 



N — 1 



a"- 



(° 2 ) 2 Vi=l 



N 

H Ar,i 



(40) 



Let 91np ( r7, ' 1 l A ^ 1 ' g ' '^0 _ q an( j together with (l39l , we have 



/ TV 
Vi=l 



?1 



R r </> Sil / (N - 1) 



(41) 



<t{ can be interpreted as the average energy mapped onto the non- signal subspace. 

Together with (|37l ), (BTT l. (139b and (I2TT) . we can get the test for Case 3. Therefore, Hi is true if: 



Tcasez — m ■ 



+ (A^-l)ln4> 7 



(42) 



Vs,l^^,l 

where <r 2 and a\ are represented in (13Tb and (HTT i. 

5) Case 4: a 2 available: In this case, we need to get MLE of A Sj i and Sj i. The logarithm of the 
likelihood function under Hq is (|29l , which can be used directly for the likelihood ratio test. 

By taking the derivative to (|30l) with respect to A Sj i, we have similar result but with known a 2 and the 
estimate of </> Sjl : 

A s ,i + a 2 = (43) 
MLE finds <f) S) i that maximize lnp (T r> i\(f) Sj i, Hi) in (l30l) . (|30l) can be rewritten as: 

\np(T rA \X s> i,(f) S7 i,Hi) = ( - — - — g - \ ] 0LR r s ,i + 5- (a 2 ,A 



s,l 



(44) 



where g (a 2 , A Si i) is the function including all other terms in (130T > . 

Since — ~ ( x ^ +a , 2 — -^j > 0, hip (r r> i | A S) i, <fi s ,i, Hi) is monotonically increasing with regard to ^^R 
The MLE of </> s i is the solution to the following optimization problem: 



arg max 

4>8,l 



(45) 



S.t. (t^i4>s,l = i 
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The solution can be found by Lagrange multipliers method. Let 

/ (0 s ,i> <*) = 0^i R r0 s ,i + « (0^i0 s ,i - l) (46) 
Let the derivative to / (0 s ,i) with respect to S) i and a be zero respectively: 

R r Sj i = i 



(47) 



Therefore, <f> S) i is the leading eigenvector of R r and a is the leading eigenvalue of R r . The MLE of (f> s> \: 

4>s,l = <f> r ,l (48) 

With @7), ©, @3j), © and (ED, we have 

T case4 = - In - 1 (49) 
Since function / (x) = x — lnx — 1 is monotonically increasing with regard to x, Hi is true if: 

TcASEi = K,i > 7 (50) 

where 7 depends on the noise variance a 2 . 

It is interesting that (|50l) is essentially the same as the signal-subspace eigenvalues (SSE) in lfT9l under 
rank-1 assumption. Ignoring the constant terms in SSE, Hi is true if: 

N' N' 

T SSE = ^^-\n^^> 1 (51) 

where N' corresponds to the largest i such that A r ,j > a 2 . If signal is rank-1, N' can be or 1. If N' = 0, 
Ar,i < a 2 . If N' = 1, SSE becomes: 

Tssei = —5- - In — (52) 

Since (|52|) is monotonically increasing with regard to A r) i and cr 2 is constant, the test can be further 
simplified as: 

Tssei = K ,1 (53) 



As a result, no matter iV' = or 1, the test statistic will be A r i, which is the same as ([50 
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6) Case 5: All parameters unavailable: In this case, we need to get MLE of A Si i, a 2 and </> Sil . By 
taking the derivative to (|29l ) with respect to a 2 , we have MLE of a 2 under H in (1371) . Using similar 
techniques in Case 3 and Case 4, we have the following MLE of A S) i, a 2 and </> S)1 : 

TV 



<*o = E A r ,i/JV, 

i=l 
JV 



(54) 



and is true if: 



*? = EA M /(7Y-i), 

i=2 

iV 

A s ,i = A rjl - E K,i/ (N - 1) 

i=2 

4>sA = 0r,l 



^2 ^2 

Tcases = In ^ + (N - 1) In § > 7 (55) 



5. Covariance Matrix Based Algorithms 

Sample covariance matrix based spectrum sensing algorithms have been proposed. MME [26] and 
CAV [23] have no prior knowledge, while FTM [27] has feature as prior knowledge. Another interesting 
algorithm is AGM [fT9l . [|20l . which is derived using dT2]) without considering the rank of R s and prior 
knowledge. We call these algorithms covariance based because the first step of all these algorithms is to 
calculate the sample covariance matrix R r from F ri . 

1) MME: MME is also derived using £[2]). Hi is true if: 



T M me = > 7 (56) 

2) CAV: Ux is true if: 

N N 

E E hj\ 

Tcav = '^f 1 > 7 (57) 

E \ru\ 

i=l 

where are the elements of R r . 



3) FTM: FTM has been introduced in ©. 
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TABLE II 

Summary of the Algorithms for Simulation 







Pnii nl~i c\r\ 
XLU UclLlUll 


.F 1 1(J1 JV1HJ W ICUgv 




Tec 


ifTTK 


R a 2 

XV S , o 


Case 1 


Tc AS El 








TqASE2 






Case 3 


TcASE3 




eh i 


Case 4 


TcASE4 




0-2 


Case 5 


J O Ao Eo 


V-J-JV 


None 


MME 


Tmme 


(Hi 


None 


CAV 


Tcav 




None 


FTM 


Tftm 




0s, 1 


AGM 


Tagm 


Eli 


None 



4) AGM: AGM is derived without considering the rank of original signal. Hi is true if: 



i N 



i=i 



AGM 



- > 7 



(58) 



N \N 

n A r)i 

vi=l / 



Among all algorithms without noise uncertainty problem, CAV do not need any eigen-decomposition 
at all and has lowest computational complexity. FTM only needs to calculate <f) T) i and this can be done 
using fast principal component analysis (F-PCA) 11281 with computational complexity O (N 2 ). To the best 
of our knowledge, only CAV and FTM have been implemented and demonstrated in hardware platforms 
successfully. 



V. Simulation Results 

All algorithms to be simulated are summarized in Table II. EC uses the original R s , Case 1 - Case 
5 uses the algorithms under rank-1 assumption. Both Case 3 and FTM have the signal feature as prior 
knowledge. Case 5, MME, CAV and AGM have no prior knowledge. Note that EC, Case 1, Case 2 and 
Case 4 have noise uncertainty problem, because the tests depends on the actual a 2 . Case 3, Case 5, MME, 
CAV, FTM and AGM, however, do not have noise uncertainty problem, because their tests do not depend 
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Fig. 5. Algorithms with prior knowledge. Pd at various SNR levels with Pf = 10%, using rank-1 signal. 

on the actual a 2 . For each simulation, zero-mean i.i.d. Gaussian noise is added according to different 
SNR. 1000 simulations are performed on each SNR level and all algorithms are applied on the same 
noisy samples for each simulation. 

A. Simulation with Rank-1 Signal 

We first use simulated WSS rank-1 signal samples to perform Monte Carlo simulation. We use N s = 10 5 
samples to obtain rank-1 R s with N = 32. Signal feature Sj i is obtained from R s . Since R s is rank-1 
matrix, EC is equivalent to Case 1. Fig. [5] shows the Pd VS SNR plot with Pf = 10% for algorithms 
with prior knowledge while Fig. |6] shows the Pd VS SNR plot with Pf = 10% for algorithms without 
prior knowledge. From the simulation results, we can see that our derived GLRT based algorithms under 
rank-1 assumption work very well. To reach Pd ~ 100%, EC requies about -24 dB SNR. It can be seen 
that Case 2 has almost the same performance with Case 1. This is because A Sj i/ (A S) i + a 2 ) in (l32l is 
constant if a 2 is stable and true to the detector, a.k.a., no noise uncertainty problem. As a result, (l32l 
and (|35l) are using the same statistics and they are essentially equivalent. Case 3 with feature as prior 
knowledge is about 2 dB better than Case 4 with a 2 as prior knowledge. Interestingly, the intuitive FTM 
is only slight worse than Case 3, though computational complexity for FTM is much lower than that of 
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Fig. 6. Algorithms without prior knowledge. Pd at various SNR levels with P; — 10%, using rank-1 signal. 

Case 3. Case 5 is slightly worse than Case 4, within 0.1 dB. Case 5 is about 1 dB better than MME, and 
1.5 dB better than CAV. AGM, however, does not have comparable performance with other algorithms 
for rank- 1 signal when SNR is low. 

Overall, among all algorithms without noise uncertainty problem, Case 3 and FTM with feature as prior 
knowledge are about 2 dB better than other algorithms when no prior knowledge available. Our derived 
GLRT based algorithm in Case 5 has best performance among all algorithms without prior knowledge. 

B. Simulation with Captured DTV Signal 

Now we use one sensing segment of DTV signal captured in Washington D.C. with N s = 10 5 and 
N = 32 to test all algorithms. 

We first examine the rank of the signal. The normalized eigenvalue distribution of R s is plotted in Fig. 
[7] It is obvious that the rank of R s is greater than 1. 

Then, we perform the Monte Carlo simulation to test the detection performance of all algorithms. 
Simulation results are shown in Fig. |8]for algorithms with prior knowledge while Fig. |9] shows the results 
for algorithms without prior knowledge. Both figures use Pd VS SNR plot with Pf = 10%. We can see 
that for DTV signal, all algorithms do not work as good as they are for the rank-1 signal. To reach 
Pd ~ 100%, EC requires about -20 dB SNR. It can be seen that Case 1 using H] is about 0.1 dB worse 
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Index 



Fig. 7. Normalized eigenvalue distribution of captured DTV signal. N s = 10 5 and N = 32. 

than EC using original R s . Again, Case 2 has the same performance with Case 1, because (l32l) and (|35l) 
are using the same statistics and they are essentially equivalent. Case 3 with feature as prior knowledge 
is about 2 dB better than Case 4 with a 2 as prior knowledge. FTM has almost the same performance 
with Case 3. Case 4 is about 1 dB better than Case 5, MME, CAV and AGM, which are all blind. It 
can be seen that for non-rank- 1 signal, AGM has almost the same performance as CAV. At -20 dB SNR, 
P d « 70% for Case 5 while only 60% and 52% for MME and CAV/AGM, respectively. At -24 dB SNR, 
however, CAV and AGM have slightly higher Pj. 

Generally speaking, among all algorithms without noise uncertainty problem, Case 3 and FTM with 
feature as prior knowledge are 2 dB better than algorithms without prior knowledge. Among all algorithms 
without prior knowledge, our GLRT based algorithm in Case 5 is slightly better than MME, CAV and 
AGM. 

VI. Conclusions 

In this paper we considered the spectrum sensing for single PU with single antenna. Received signal 
is oversampled with unknown oversampling rate and modeled as a non-white WSS Gaussian process. 
Using the concept of pattern recognition in machine learning, we defined the signal feature as the leading 
eigenvector of the signal's sample covariance matrix. Our previous work has found that signal feature is 
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Fig. 8. Algorithms with prior knowledge. Pd at various SNR levels with Pf = 10%, using captured DTV signal. 




Fig. 9. Algorithms without prior knowledge. P^ at various SNR levels with Pf — 10%, using captured DTV signal. 



robust against noise and stable over time. Both simulation and hardware experiments showed that signal 
feature can be learned blindly. In addition, by using signal feature as prior knowledge, the detection 
performance can be improved. 

Under rank-1 assumption of the signal covariance matrix, we derived several GLRT based algorithms 
for signal samples considering signal feature as one of the available parameters, as well as signal power 
and noise power. 

Rank-1 signal and captured DTV data were simulated with our derived GLRT based spectrum sensing 
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algorithms and other state-of-the-art algorithms, including MME, CAV, FTM and AGM. MME, CAV and 
AGM can be viewed as the benchmark algorithms when no prior knowledge is available, while FTM 
can be viewed as the benchmark algorithm when only signal feature is available. The simulation results 
showed that our derived GLRT based algorithms have relatively better performance than the benchmark 
algorithms under the same available prior knowledge conditions. In general, algorithms with signal feature 
as prior knowledge are about 2 dB better than the algorithms without prior knowledge, and 2 dB worse 
than EC when all parameters are prior knowledge. Interestingly, the detection performance of FTM was 
almost the same as that of our GLRT based algorithm with signal feature as prior knowledge, though 
FTM has much lower computational complexity and has already been implemented in our previous work. 

More generalized results under rank-k assumption will be discussed. New methods in [12111 . G2l will 
be applied in our framework. Spectrum sensing for multiple antennas and cooperative spectrum sensing 
will also be discussed. Moreover, we will explore more machine learning techniques for cognitive radio, 
including robust principal component analysis [|29l , fast low-rank approximations [1241 . manifold learning 
11301 , etc. 
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